126880 


TfflS  DOCUMENT  IS  BEST 
QUALITY  AVAILABLE.  THE  COPY 
FURNISHED  TO  DTIC  CONTAINED 
A  SIGNIFICANT  NUMBER  OF 
PAGES  WHICH  DO  NOT 
REPRODUCE  LEGIBLY. 


MASSACHUSETTS  INSTITUTE  OF  TECHNOLOGY 
LINCOLN  LABORATORY 


PACKET  SPEECH  SYSTEMS  TECHNOLOGY 


SEMIANNUAL  TECHNICAL  SUMMARY  REPORT 

TO  THE 

DEFENSE  ADVANCED  RESEARCH  PROJECTS  AGENCY 


1  APRIL  —  M  SEPTEMBDt  IMS 


Areessleo  Por 

JfT^S  OTAJtl 
MIC  TAB 
UiMinnounced 
Justification. 


ISSUED  S  PEBRUARY  198S 


n 


By- - 

l)lstrl>>ttt  1  oiV _ 

Availability  Codes 
■  lAvail  eod/or 
>ist  Spaelal 


Approved  for  public  idcMc;  diHilbMloB  unUnhcd. 


LEXINGTON 


MASSACHUSETTS 


ABSTRACT 

Thif  report  deacribee  work  perforawd  on  the  Packet 
Speech  Syateaa  Technology  Prograa  aponaored  by  the 
Infonaation  Proceaaing  Techniquea  Office  of  the 
Defenae  Advanced  Reaearch  Projecta  Agency  during 
the  period  1  April  through  30  8epte!ri>er  1982. 
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INTRODOCTION  AND  SUMHART 

The  loag-raage  object ivee  of  the  Packet  Speech  Syeteu  Technology 
Program  are  to  develop  and  demonatrate  techniquea  for  efficient  digital 
epee^h  communications  on  networks  suitable  for  both  voice  and  data,  and  to 
investigate  and  develop  techniques  for  integrated  voice  and  data 
comsHinication  in  packet ised  networks,  including  wideband  common-user 
satellite  links.  Specific  areas  of  concern  are:  the  concentration  of 
statistically  fluctuating  volumes  of  voice  traffic,  the  adaptation  of 

communication  strategias  to  varying  conditions  of  network  links  and  traffic 

\  ' 

volume,  and  the  interconnection  of  wideband  satellite  networks  to  terrestrial 
systesu. 

Previous  efforts  in  this  area  have  led  to  new  vocoder  structures  for 
improved  narroid>and  voice  performance  and  multiple-rate  transmission,  and  to 
demonstrations  of  conversational  speech  and  conferencing  on  the  AlPAHST  and 
the  Atlantic  Packet  Satellite  Network. 


The  current  program  has  two  major  thrusts:  i.e.,  the  development  and  , 
refinement  of  practical  low-cost,  robust,  narrowband,  and  variable-rate 
speech  algorithms  and  voice  terminal  Structures;  and  the  establishment  of  an 
experimental  wideband  satellite  network  to  serve  as  a  unique  facility  for  the 
realistic  investigation  of  voice/data  networking  strategies.  - 

This  report  covers  work  in  the  following  areas:  compact  vocoder 
developamnt  based  on  digital  LSI  technology;  developamnt  of  Packet  Voice 
Terminal  (PVT)  and  local  access  network  (LIZNBT)  facilities  for  experimsnts 
in  packet  voice;  development  and  ej^erimental  application  of  a  flexible 
internet  stream  gateway  (the  adaiceacnatrator)  for  packet  voice;  planning, 
coordination,  and  execution  of  aulci'«d*er  packet  speech  experiments  on  the 
experimental  wideband  satellite  network  (WB  8ATNBT);  and  design  and 
development  of  a  system  for  voice  control  of  network  voice  conferencing  in 
the  wideband  systam. 

Ten  single-card  2400-bps  LPC  vocodars,  using  factory-pr^rammsd  versions 
of  the  me  Signal-Processing  Interface  (SPI)  chips,  have  baen  constructad, 
debuggad,  and  are  operational  In  PVTs  in  tha  exp*vimeatal  wideband  network. 


A  coapact  eabedded  LPC  vocoder  syacea  which  operates  in  the  800-  to 
4800-bps  range,  and  can  be  realised  in  herdware  using  the  NEC  SPl  chips  as 
prograassed  for  the  2400-bps  coapact  LPC,  has  been  developed.  Both  five-rate 
and  three-rate  systeas  have  been  iapleacnted  in  real  tiae  on  the  Lincoln 
Digital  Signal  Processor  (LD8P).  Architectures  and  aicrocode  have  been 
developed  for  hardware  realisations  using  the  NEC  SPIs  and  an  INTEL  8085 
control  and  interface  aicrocoaputer. 

LEXNETs  and  PVTs  are  operating  at  four  wideband  network  sites  and  have 
been  used  in  a  variety  of  ezperiaents  and  deaonst rat ions.  A  draft 
specification  has  been  written  for  production  of  PVTs  by  an  industrial 
contractor.  PVT  conferencing  software,  including  robustness  features  and 
three  wethods  of  conference  setup,  is  operational.  A  weasurewent  host 
capability,  including  a  tiwer  card  to  alljew  precise  cross-net  packet  delay 
■easureaents  based  on  a  global  tiae  source,  has  been  added  to  the  PVT. 

Miniconcent rator  gateway  hardware  has  been  running  reliably  at  four 
sites  throughout  this  reporting  period.  Several  aajor  extensions  to  the 
PDP-11  gateway  prograa  have  been  added,  in  the  areas  of  speech  conferencing, 
robustness,  aeasureaant  and  instruaentation  facilities,  and  external  control 
of  the  gateway.  The  gateways  were  a  aajor  eleaent  in  deaonst rat ions  of 
internet  packet  speech  and  conferencing,  as  well  as  in  a  large  set  of 
experiasnts  and  perforaance  validations  on  the  wideband  systea. 

Lincoln  carried  out  experiasnt  planaing  and  coordination,  as  well  as 
developasnt  of  PVTs,  LIENITs,  and  gateways,  for  a  3  June  dsswnstration  at  the 
final  DAEPA  narrowband  packet  spaach  prograa  asating.  The  daa»astratioo, 
which  in  itself  was  a  aajor  expariasntal  ailastona  for  the  wideband  systea, 
involved  the  Lincoln,  181,  and  SEX  sites.  Point-to-point  and  conference 

calls  usii«  2400-bps  LPC,  16-  to  64-kbps  sabsddsd  CV8D,  and  64-kbps  PCM  ware 
dssnastrated.  Calls  of  particular  iatarast  iaclwdad:  (1)  a  PCM  call  froa  a 
LBXNBT  PVT  at  Lincoln  and  to  the  local  Los  Angelas  weather  oubbar  through  a 
switched  telephone  network  interface  (STNX)  at  181;  (2)  a  3-site,  4-party  LPC 
conference  using  single-card  ceagaet  LPCs  in  PVTs  oa  UtEmis  at  Lincoln  and 
181,  and  a  CRl-V  LPC  on  a  packet  radio  not  (PMET)  unit  at  I8X;  and  (3)  a 
call  between  an  LPC  on  a  Lincoln  LUMIT,  and  an  tU  LPC  in  a  a^ile  van 


wUi 


PRMET  unit,  d«aon«traeing  aoblle  PVXBT  spnnch  and  alternate  packet  routing  aa 
the  van  traveled  in  the  Palo  Alto  area. 

The  voice-controlled  operator  (VCOP)  haa  been  inpleiaented  and  teated. 

LPC  and  PCM  conferencea  involving  tuo  to  four  participanta  have  been 
aucceaafully  aet  up  by  voice  betucen  two  LBCMETa  connected  through  a  gateway. 
The  conference  initiator  aeta  up  the  conference  by  calling  VCOP  from  any  PVT 
and  carrying  on  a  voice  dialogue  atructured  by  aeana  of  voice  prowpta  from 
VCOP. 


PACKET  SPEECH  SYSTStS  TECHMOLOGT 


I.  COMPACT  LPC  VOCODER  DEVELOPMEMT 

In  Che  previous  sesiiannuel,  it  was  reported  that  a  pair  of  single-card 
2.4-kbps  PVT-coapaCible  LPC  vocoders  had  been  deaonstraCed  using  EPROM 
versions  of  Che  Signal-Processing  Interface  (SPI)  chips  ()iPD7720).  During 
Che  period,  an  order  was  placed  with  HEC  Electronics,  U.S.A. ,  for  150 
factory-progrsHsed  SOM  versions  of  each  of  Che  three  SPI  single-chip 
aicrocoaputers  used  in  Che  single-card  LPC  vocoder.  The  order  has  been 
received  in  full  and  cheeks  on  approxiaacely  one-third  of  the  450  pieces  have 
found  the*  to  be  functional.  A  total  of  14  PVT-cowpatible  LPC  boards  have 
been  fabricated  and  debugged,  10  of  which  are  presently  operational  in  Che 
field.  The  boards  are  populated  with  the  factory-prograswttd  RM  SPIs.  Four 
are  in  use  in  PVTs  at  Information  Sciences  Institute  (ISI),  SRI 
International,  and  Defense  Communications  Engineering  Center  (DCEC).  Six  are 
in  use  in  PVTs  at  Lincoln. 

Two  of  the  LPC  vocoders  at  Lincoln  and  two  vocoders  at  ISI  and  SRI  were 
successfully  used  in  a  four-party  conference  over  Che  wideband  network  in  a 
rehearsal  for  the  3  June  packet  speech  demonstration.  During  the  actual 
demonstration  on  3  June,  the  SRI  site  used  the  CHI-V  LPC  implementation 
thereby  demonstrating  Che  co^atibility  of  the  Lincoln  and  Cul ler-Harrison, 
Inc.  (CHI)  LPC  vocoders. 

The  PVT  telephone  handsets  use  a  passive  microphone  element  produced  by 
Telephonies.  This  element,  which  performs  adequately  for  PCM  and  enbedded 
evSD  (ECVSD)  voice  digitisation,  has  been  found  to  significantly  degrade  LPC 
vocoder  perforomnee.  Currently  a  sderophone  element  made  by  Roanwell,  idiich 
has  been  found  to  be  satisfactory  for  use  with  LPC,  is  being  used  to  replace 
Che  Telephonies  units  in  PVT  handsets  that  communicate  with  Che  LPC  vocoder. 
Unfortunately,  the  Roanwell  element  is  no  longer  comsmrcially  available  and 
Lincoln  has  only  a  small  number  in-house.  Recently,  ws  have  been  able  to 
purchase  samples  of  a  new  passive  microphone  element  produced  by  Roanwell. 
Informal  tests  using  this  element  indicate  satisfactory  performance  with  the 


LPC  vocoder.  If  eh*  •i*MOC  pM««*  «or*  ritorou*  eest*  it  will  be  used  es 
Che  new  stsnderd  PVT  aicrophone  repleeint  the  Tele^onics  end  older 
RoeaiMll  units. 

Oeteiled  herArer*  end  softwer*  desiens  here  been  prepered  for  e 
generelised  stend-alon*  version  of  the  cciapeet  LPC  to  include  self-contained 
audio  and  an  R8-232  interface  for  connection  to  host  computers.  These 
designs  are  based  partially  on  colliAoraciv*  effort  with  ISl,  particularly  on 
Che  specification  of  Che  protocol  on  Che  serial  18-232  link.  In  addition, 
coaaents  received  froa  other  airtiirs  of  the  DABPA  pecket  speech  cowainity  on 
an  earlier  functional  design  deenment  have  been  factored  into  the  detailed 
design.  Hs  sipecc  the  initial  hardware  and  software  development  and  checkout 
CO  be  completed  during  the  first  quarter  of  FT  83. 


II.  COMPACT  EMBEDDED  VOCODER 


A  compact  multirate  LPC  vocoder  ayatem  haa  been  developed  which  (a) 
operatea  aa  an  embedded  apeech  coder  in  the  800-  to  4800-bpa  ranges  and  (b) 
can  be  implemented  in  hardware  uaing,  without  change,  the  NEC  SPl  analyaer, 
ayntheaizer,  and  pitch  detector  chipa  developed  for  the  cooqtact  LPC 
(diacuaaed  in  Sec.  I).  The  mechaniam  by  which  embedded  coding  haa  been 
effected  haa  been  to  develop  an  interface  between  the  LPC  algorithm  and  the 
external  comaunication  ayatem,  of  which  the  LPC  analyaer  and  ayntheaiaer  are 
totally  ignorant.  The  interface  ia  therefore  not  reatricted  to  the  NEC  SPl- 
baaed  LPC  realization,  but  could  operate  with  any  equivalent  LPC  realization. 
Both  five-rate  and  three-rate  ayateam  have  been  developed.  The  vocodera  were 
firat  iaq>lemented  aa  real-time  ayatema  on  the  Lincoln  Digital  Signal 
Proceaaor  (LDSP).  Architecturea  were  developed  for  hardware  realizationa 
uaing  the  NEC  SPla  and  an  INTEL  8085-baaed  microconq>uter  for  the  interface 
between  the  vocoder  chipa  and  the  PVT.  Microcode  waa  developed  for  the  8085, 
and  it  waa  ahown  that  a  aingle  8085  would  aupport  either  the  five-  or  three- 
rate  ayatem  in  real  tiaw. 

The  five-rate  ayatem  will  be  deacribed  firat,  along  with  the  baaic 
analyzer  and  ayntheaizer  interfacea.  The  ayatem  operatea  at  the  five  ratea 
of  889,  1244,  2489,  3556,  and  4622  bpa.  Thia  particular  act  of  bit  ratea  ia 
explained  by  the  fact  that  the  ead>edded  vocoder  ia  deaigned  to  communicate 
with  a  packet  network  in  which  parcela  muat  be  multiplea  of  8-blt  bytea. 

Thia  deaign  aaaumea  an  analyzer  which  updatea  LPC  parametera  every  11.25  ma. 
Aa  noted  above,  the  mechaniam  by  which  eiri>edded  coding  ia  effected 
eaaentially  involvea  an  interface  between  the  LPC  algorithm  and  the  external 

world  of  which  the  LPC  analyser  and  ayntheaizer  are  totally  ignorant.  The 

different  ratea  are  obtained  by  application  and  extenaion  of  frame-fill 
1  2 

concepta  *  to  conform  with  the  eonatrainta  of  embedded  coding. 

A  dcacription  of  the  analyser  interface  follows.  Aa  ahown  in  Fig.  1, 
raw  parametera  received  from  the  LPC  analyser  are  coded  and  buffered  until 
four  sets  of  parameters  (A  to  D)  representing  45  ms  have  been  accumulated. 


(The  pitch  period  ia  coded  logerichaiically  as  in  the  DoD  standard 
3 

algorithm,  and  the  energy  and  K-parameters  are  coded  using  tables  from  the 
Lincoln  autocorrelation  LPC  (Ref.  4),  which  has  been  utilized  in  most  of  the 
DARPA  packet  speech  experiments.  Using  the  four  sets  of  coded  parameters, 
there  are  now  three  tasks  to  be  performed.  First,  sets  of  truncated  pointers 
from  frames  A,  C,  and  E  are  used  to  decide  how  best  to  reconstruct  the 
parameters  of  frame  C  in  the  event  that  the  synthesizer  interface  receives 
only  priority  I  information.  The  priority  I  parcel  is  then  formed  which 
contains  the  minimum  information  needed,  as  seen  in  Table  I.  Second,  the 
sets  of  full  pointers  from  frames  A,  C,  and  E  are  used  to  make  the  frame-fill 
decision  for  frame  C.  This  information  plus  the  LSBs  of  the  truncated 
pointers  become  the  priority  II  parcel  (see  Table  II).  Third,  the  sets  of 
pointers  from  frames  C,  D,  and  E  are  used  to  determine  the  frame-fill 
strategy  for  frame  D  in  the  event  that  the  parameters  of  frame  C  reach  the 
synthesizer  interface.  The  contents  of  the  priority  III  parcel  are  shown  in 
Table  III.  (Note  that  the  LSBs  of  the  pitch  period  pointers  for  frames  B  and 
D  are  included.)  Priority  IV  and  V  parcels  containing  the  parameters  for 
frames  B  and  D  (shown  in  Tables,  IV  and  V)  are  then  formed.  All  parcels  are 
now  ready  for  transmission  to  the  protocol  processor. 

The  synthesizer  interface  receives  from  the  protocol  processor  either 
all  (I  through  V)  or  a  stripped  subset  (I  through  ?)  of  the  transmitted 
parcels  and  the  indication  of  which  subset  has  survived  transmission 
(V,IV, . , .  ,1) ,  If  only  priority  I  has  survived,  reconstruction  of  frames  B, 

C,  and  D  is  as  follows  (refer  to  Fig.  2).  The  coded  parameters  for  frame  C 
are  formed  either  by  copying  from  frame  A  or  E  or  interpolating  between  them 
as  dictated  by  the  fram-fill  strategy  included  in  the  priority  I 
information.  The  pitch  pointers  of  A  and  C  are  copied  into  B  and  D, 
respectively. 

The  energy  and  K  pointers  of  B  and  D  are  formed  by  interpolating  the 
adjacent  frames  as  shown.  The  four  frames  of  paraawters  are  then  decoded 
using  a  special  set  of  decoding  tables  with  coarser  quantisation. 
Reconstruction  of  the  four  frames  is  the  same  if  priority  II  information  has 


Tiin<B  I 


PRIORITY  I  PARCEL  (5  Bytes) 


TABLE  II 


PRIORITY  II  PARCEL  (2  Bytes) 


K5  (A) 
1  L8B 


KA  (A) 
1  L8B 


PraM-Pill 
Strategy  (C) 


E2  (A) 

2  LSBs 

El  (A) 

2  LSBs 

ElO  (A) 

1  LSB 

K9  (A) 

1  LSB 

E8  (A) 

1  LSB 

E7  (A) 

1  LSB 

E  (A) 
1  L8B 


TABLE  V 

PRIORITY  V  PARCEL  (6  Bytea) 

- E  (D) 

Pitch  (D) 

5  MSBa 

K1  (D) 

K2  (D) 

- R5  (D) 

K4  (D) 

„ - 

- r?  (D) 

K6  (D) 

- - 

KIO  (0) 

K9  (D) 

K8  (D) 

- - 

b««a  received  except  that  the  final  decoding  of  the  paraaetera  ia  done  uaing 
the  regular  aet  of  decoding  tablea.  Figure  3  ahowa  fraae  recona true t ion  when 
priority  III  paraaetera  are  received.  Fraae  0  ia  foraed  aa  dictated  by  the 
ftaae-fill  atrategy  for  0,  and  fraae  B  ia  conatrueted  aa  deacribed  above.  If 
priority  IV  inforaation  haa  been  received,  only  fraae  D  haa  to  be  f^.lled 
before  decoding  the  paraaetera,  aa  ahown  in  Fig.  4.  Priority  V  inforaation 
neceaaitatea  only  decoding  the  paroMtera.  Decoded  aet a  of  paraaetera  are 
then  buffered  and  aent  to  the  ayntheaiser  every  11.25  aa.  The  decoding 
tablea  for  the  energy  and  K-paraaetera  are  double  length  to  accoaaodate  the 
interpolation  of  pointera. 

Thia  five-rate  eabedded  LPC  vocoder  waa  firat  iapleaented  aa  a  real-tiae 
ayatea  on  the  LDSF.  Oiagnoatic  Bhyae  Teat  (DRT)  acorea  of  vocoder 
perforaance  were  obtained  for  the  following  ratea: 

Bate  (bpa)  DBT  Score 


889 

84.4 

1244 

87.2 

2489 

90.6 

4622 

91.1 

Scorea  were  averaged  over  three  apeakera. 


Our  ultiMte  goal  vaa  to  aaaaaa  the  faasibility  of  using  an  IMTEL  8085- 
basad  8-bit  aiicrocoaputer  to  support  the  interface  between  the  LPC  vocoder  in 
the  MEC  chips  and  the  Packet  Voice  Tensinal.  Figure  5  depicts  the  suggested 
architecture  for  a  five-rate  e^edded  LPOIO.  Notice  that  an  additional 
analyser  chip  would  be  needed  for  the  computational  capacity  to  provide  a  new 
set  of  reflection  coefficients  every  11.25  ms.  The  vocoder  interface  was 
completely  programswd  for  the  8085  and  verified  with  canned  data  on  the 
emulator  of  the  Hewlett-Packard  development  system.  Despite  initial  fears 
that  a  single  8085  would  not  suffice,  it  appears  that  if  a  200-ns  clock  were 
used  real  time  would  not  be  exceeded.  Memory  requiresMnts  for  this  interface 
are: 

ROM  «  6K 
RAM  -  IK 

The  five-rate  embedded  LPC-10  interface  was  then  stripped  to  represent 
only  three  rates:  889,  1244,  and  2311  bps.  This  interface  does  not  require 
an  additional  NEC  analyser  chip  because  frames  are  22.5  ms  in  length  instead 
of  11.25  SM.  The  contents  of  the  three  priority  parcels  may  be  seen  in 
Tables  VI  through  VIII.  Memory  requirements  are: 

RON  -  4K 
RAM  -  IK 

Both  the  five-rate  and  three-rate  LPC  analyser  interfaces  include 
silence  detection.  If  the  residual  energy  has  not  exceeded  a  threshold 
during  the  last  300  ms,  this  information  is  conveyed  to  the  protocol 
processor  via  the  8-bit  header.  The  energy  threshold  is  adaptive  in  order  to 
accoHBodate  differences  in  handsets  and/or  ambient  noise.  The  algorithm  for 
adaptation  is  deliberately  simplistic.  Every  N  seconds  a  new  minimum  energy 
is  established.  All  frames  idiich  are  either  voiced  or  exceed  the  energy 
threshold  by  M  decibels  are  declared  speech.  N  has  been  arbitrarily  set  to 
2  s,  and  M  is  equal  to  4.5  dB.  Since  the  quantised  energies  of  the  coding 
table  are  spaced  a  fixed  decibel  apart,  the  algorithm  deals  solely  with 
pointers  to  the  coded  values,  eliminating  any  need  for  16-bit  arithmetic. 
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4  SETS  OF  COOED  LPC  PARAMETERS 
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4S  ms 


LATEST 

COOED 

PARAMETER 

SET 


PREVIOUS 

A 

PARAMETERS 


_ i _ I 

FRAME-FILL  DECISION  FOR  FRAME  C 
USING  TRUNCATED  POINTERS 

_ 1 _ 1 

FRAME-FILL  DECISION  FOR  FRAME  C 
USING  FULL  POINTERS 

I _ I  I 
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Fit*  S.  Cabcddcd  LPC  architecture. 
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III.  PACKET  VOICE  TESMIllAL 


A.  PVT  AND  LEXNET  HASDHABE  STATUS 

Two  additional  LEXNET  Coacantrator  Intarface  (LCI)  units  have  been 
constructed  to  allow  the  installation  of  networks  at  new  sites  while 
■aintaining  the  capability  of  perforaing  two-network  tests  at  Lincoln. 

LEXNETs  have  been  installed  at  SRI  and  DCEC,  each  consisting  of  an  LCI 
and  one  PVT.  They  were  initially  installed  to  support  the  3  June 
deawnstration,  but  now  reauiin  in  place  for  use  in  general  testing  of  the 
satellite  channel.  Another  single-tensinal  LEXNET  has  been  returned  to  Bolt, 
Beranek  and  Newaan  (BBN)  for  use  in  testing  of  the  Voice  Funnel. 

Several  hardware  upgrades  have  been  installed  in  the  LEXNET  equipaent 
over  the  last  quarter.  The  aost  significant  of  these  was  a  change  to 
increase  the  data  rate  on  the  channel  froa  the  LCI  to  the  concentrator  froa 
375  to  600  kbps.  This  change  is  possible  because  of  ii^roved  software  for 
handling  the  810  chip  on  the  UHC-Z80.  Further  experiaents  were  conducted  to 
see  if  this  rate  could  be  increased  further.  It  was  found  that  this  link 
would  start  to  fail  at  about  625  kbps,  but  would  work  reliably  at  lower 
rates;  the  current  LCI  cards  are  set  to  run  at  600  kbps.  The  reaaining 
changes  involved  ainor  wiring  additions  to  allow  the  installation  of  the 
tiaer  card  in  the  PVT. 

B.  PVT  TECHNOLOGY  TRANSFER 

A  draft  specification  for  the  PVT  has  been  written  and  is  undergoing 
internal  review.  The  specification  asks  for  terainals  which  arc  electrically 
identical  to  the  current  terainals.  lie  ere  requiring  that  cards  be  usable  in 
the  current  terainals  for  testing  and  spares,  but  we  give  the  cont rector  the 
option  of  translating  the  design  to  a  printed  circuit  card.  The  Request  for 
Quote  (RFQ)  will  ssk  for  prices  st  quantities  of  10,  20,  and  50  terainals. 


C.  PVT  SOFTWAKE  STATUS 


The  PVT  conferencint  feature  Ic  aow  operationel.  The  PVTa  handle  three 
vocoders:  PCM,  2400-bps  LPC  (MEG  SPl-based  vocoder) ,  and  eabedded 
Continuously  Variable  Slope  Delta  Modulation  (ECVSD).  The  ECVSD  can  run  at 
rates  at  14,  32,  48,  and  64  kbps.  The  rate  can  be  changed  at  any  tiae,  even 
in  aid-talk  spurt. 

A  distributed  "floor  controller"  is  operational.  The  algoritha  is 
included  in  the  PVT  software  and  resolves  conflicts  when  aore  than  one 
participant  tries  to  talk  at  the  saac  tiae.  The  PVT  will  not  transait  speech 
if  its  user  does  not  "have  the  floor."  Conflicts  are  currently  resolved  on  a 
siaple  priority  basis  but  the  floor  controller  is  designed  so  that  other 
algorithas  aay  be  easily  substituted. 

A  tiae  out  and  retranaaission  BMchaaisa,  for  enhanced  robustness  in  call 
setup,  has  been  incorporated  into  the  PVT  software.  The  PVT  monitors  each 
control  aessage  sent  and  retranaaits  a  aessage  when  the  appropriate  response 
is  not  received.  After  ten  consecutive  failures  the  PVT  will  autoaatically 
close  down  the  connection.  This  feature  should  ensure  that  conference  and 
point-to-point  (PTP)  connections  are  reliably  established  even  during  tiaas 
of  hi^  packet  loss.  This  algoritha  has  been  debugged  locally  between  PVTs 
and  between  PVTs  and  the  Access  Controller.  It  aust  still  be  tested  «.yer  the 
satellite. 

The  Access  Controller  (AC)  has  been  operating  successfully  for  a  variety 
of  conference  scenarios.  As  a  system  resource,  the  AC  software  is  running  in 
a  PVT  at  a  "well  known  address"  on  a  LEEMET  at  Lincoln.  Currently,  the  AC 
knows  about  three  possible  conferences  (one  for  each  possible  vocoder  type). 
It  currently  accepts  all  callers  who  know  the  conference  naae  and  password 
and  idio  can  use  the  correct  vocoder.  Other  types  of  conferences  aay  be  added 
easily. 

The  AC  has  the  ability  to  operate  with  the  new  PVT  foftware.  It 
facilitataa  the  retransaiasion  aechanisa  by  acknowledging  all  received 
■assagas  and  it  haodlas  tha  duplieata  aaasagas  it  occasionally  recaivas 
bacausa  of  ratransaiasions. 
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Coaferencc*  aay  b«  act  up  ia  thru#  ways.  All  the  participaats  caa  dial 
the  AC  aad  ask  to  joia  the  coafereaee  (referred  to  as  "Meet  Me"),  a 
coafereace  origiaator  caa  iavlte  ia  others  that  he  wishes  to  have  ia  his 
coafereaee,  or  the  coafereace  origiaator  caa  dial  the  Voice-Coatrolled 
Operator  (VCOP)  aad  request  that  a  coafereaee  be  set  up. 

Ia  the  "Meet  Me"  style  of  eoafereoeiog  each  participaat  dials  the  AC. 

The  AC  provides  iaforaatioa  about  the  others  ia  the  coafereace  aad  PVT 
software  executes  the  aecessary  protocol  steps  to  set  up  coaaectioas  to  all 
the  other  coaferees.  This  was  the  first  SKthod  of  eoafereoeiog  iwpleweoted 
aad  was  deaoostrated  at  the  3  Juoe  packet  speech  aeetiog. 

A  oew  dial  up  aethod  of  startiog  a  coafereace  is  oow  workiog.  The 
coafereace  origiaator  after  eateriog  the  coafereaee  hiwself,  dials  ia  turo 
the  PVT  addresses  of  the  other  participaats.  His  PVT  seeds  a  special  "Please 
Joia  a  Coafereaee"  to  the  other  PVlts.  A  PVT  reeeiviog  this  aessage  riogs  its 
phooe.  Hheo  the  phooe  is  aoswered  the  PVT  ioitiates  the  protocol  exchaoge 
with  the  AC  which  lets  the  iadividual  joia  the  coafereace. 

Based  oo  the  code  writtea  to  iaplesaot  "Please  Joia  a  Coafereace,"  the 
first  stage  of  the  PVT  software  aecessary  to  support  the  Voice-Coat rolled 
Operator  (VCOP)  is  writtea  aad  largely  debugged.  A  coafereace  origiaator 
calls  the  VCOP  PVT.  Nhea  the  protocol  exchaoge  is  fioished,  VCOP  asks  the 
origiaator  a  series  of  quest ioos  to  obtaio  aecessary  coafereace  paraaeters. 
VCOP  theo  requests  the  origiaator  to  haag  up  his  ^ooe.  VCOP  passes  to  its 
PVT  the  paraaeters  it  has  received  froa  the  origiaator.  Its  PVT  theo  issues 
"Please  Joia  a  Coafereace"  aessages  to  the  participaats*  PVTs.  Sectioo  VI 
discusses  VCOP  status  ia  aore  detail. 

Other  features  have  beeo  added  to  aake  the  PVTs  aore  robust.  Closiog 
dowo  a  coafereace  caa  take  aore  thaa  20  a  if  the  PVT  caooot  get  its 
discoaoect  asssage  through  to  soae  other  participaat  (the  discooaect  is 
retraomitted  tea  tiaas  at  2*s  iotervala).  If  a  caller  picks  up  the  j^ooe 
duriog  this  period  he  will  hear  a  feat  busy  sigoal  aad  his  dialiog  will  be 
igaored.  As  sooo  as  the  shutdowa  protocol  has  coapleted,  the  caller  will  get 
a  dial  tooe  aad  caa  pla^e  his  call. 
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Software  in  both  the  PVTs  ead  the  AC  hee  been  upgraded  to  correctly 
handle  the  case  where  a  PVT  crashes  and  then  tries  to  reenter  the  conference. 
A  PVT  can  hang  up  and  later  rejoin  a  conference  at  will.  During  a  conference 
which  has  been  set  up  in  any  of  the  three  ways  listed  above,  any  participant 
can  "Ask  In"  another  PVT  by  dialing  its  PVT  address. 

D.  MBASOKBNBirr  HOST  DBVBLOPMBIIT 

A  aeasurewent  host  capability  has  been  added  to  the  PVT,  in  order  to 
augment  our  facilities  for  carrying  out  performance  weasuresttnts  on  the 
wideband  network.  The  s»st  important  difference  between  this  measurement 
host  and  previous  PVT-based  traffic  emulators  is  the  capability  for  precise 
timing  smasurements  on  cross-net  packet  tranmissions.  This  capability  is 
provided  by  means  of  a  new  timer  card  which  plugs  into  the  vocoder  slot  on 
the  PVT.  The  timer  card,  in  turn,  connects  to  a  precise  global  tiung  clock 
which  has  been  provided  to  us  by  X81.  The  clock  obtains  precise  global  time 
from  a  radio  signal  (HHVB)  maintained  by  the  Rational  Bureau  of  Standards. 
Similar  WiVB  clocks  are  installed  at  the  ISl  site.  In  addition  to  the  timer 
card,  the  measurement  host  also  includes  new  software  in  the  protocol 
processor  which  provides  a  flexible  capability  for  packet  generation,  delay 
histogram  collection,  and  configuration  of  experiments. 

The  user  can  configure  experiments  based  on  the  following  set  of 
capabilities: 

(1)  Traffic  Models 

Deterministic  -  The  host  generates  fixed-length 
packets  of  L  bytes  at  a  steady  rate  It  per  second.  R 
and  L  are  selectable  at  set-up  tiaw. 

Poisson  -  Same  as  deterministic  except  R  is  now 
taken  to  be  a  mean  rate  in  essilating  a  Poisson 
process. 

Talker  Activity  Model  -  The  host  emulates  a 
statistical  model  of  talker  activity  for  each  of  R 
talkers.  Every  J  frames  (1  frame  ■  20  ms)  it 
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examines  the  activity  of  each  simulated  talker  over 
the  previous  J  frames  and  sends  a  packet  of  L  bytes 
if  activity  occurred.  The  user  can  select  N,  J,  and 
L,  and  certain  parasmters  of  the  activity  aM>del. 

(2)  Packet  Protocol 

Either  the  Internet  datagram  Protocol  (IP)  or  Stream 
(ST)  Protocol  can  be  selected. 

(3)  Output 

The  receiving  host  keeps  a  histogram  of  the  absolute 
delay  or  delay  dispersion  depending  on  whether  both 
hosts  have  access  to  a  coonon  clock  (see  below).  At 
set-up  time  the  user  can  select  the  histogram  bin 
sise. 

A  typical  experimental  configuration  la  illustrated  in  Fig.  6.  The 
entire  experiment  can  be  controlled  from  one  of  the  hosts.  In  Fig.  6  the 
user  communicates  directly  with  A  via  terminal  and  can  ask  A  to  send  control 
messages  to  B  as  well. 

The  label  "HWVB”  refers  to  the  HWVB  radio  signal  (maintained  by  the  NBS) 
which  can  be  used  to  synchronise  clocks.  The  swasurement  host  is  designed  to 
run  with  the  SPECTRACOM  Model  8170  WNFB  synchronised  clock.  If  these  clocks 
are  available  to  both  hosts,  one  obtains  one-way,  absolute  delay 
measurements.  Otherwise,  one  or  both  hosts  must  use  an  internal  clock,  and 
they  measure  one-way  delay  dispersion  and  round-trip  absolute  delay. 

The  procedure  in  a  typical  experiment  is  as  follows. 

(1)  The  user  at  A  tells  B  to  be  a  receiver  for  packets 
from  A.  At  this  point,  B  seroes  its  counts  and 
waits  for  packets  from  A. 

(2)  The  user  tells  A  to  transmit  to  B,  selecting  one  of 
the  options  described  before.  A  then  transmits  to 
B,  selecting  one  of  the  options  described  before.  A 
then  transmits  time-stamped  packets  to  B. 

(3)  The  user  tells  A  to  stop  transmitting. 
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(4)  1h«  u««r  then  retrieves  results.  Host  B  is  told  to 

send  its  results  froa  display  oa  A's  terainal.  The 
histogrsa  appears  in  tabular  fora.  Host  A  can  be 
asked  to  display  the  nu^er  of  packets  sent  for 
coaparison. 

The  tiaing  and  traffic  esulation  functions  of  the  aeasureaent  host  are 
perforaed  by  a  aicroprocessor-based  tisdng  card  which  occupies  the  speech 
processor  slot  of  the  PTT.  the  aain  swasuresant  host  prograa,  tdiich 
interacts  with  the  user  and  constructs  the  needed  packets  and  headers, 
replaces  the  MVP  processor  progrsa  of  sn  ordinary  PVT. 

The  hardware  is  coaplete  and  preliainary  ezperiaents  have  been  run  using 
IP  packets.  Hork  continues  on  adding  an  ST  packet  capability  to  the  card. 
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Pig.  6.  Nsasureasnt  host  experiasnt  configuration. 
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IV.  MINICONCENTRATOR 


A.  HARDWARE  STATUS 

Throughout  this  seaiannuai  reporCiag  period  aiatconceacrator  gatevaya 
have  been  running  at  ISI,  SRI,  DCEC,  and  Lincoln  Laboratory.  The  gateway!  at 
SRI  and  DCEC  trere  augmented  with  hardware  to  aupport  LEXNETs  so  that  a  richer 
packet  voice  environawnt  would  be  available  for  demonstrations  and 
experiments.  All  the  gateways  now  in  service  have  three  UMC-Z80  interface 
processors  connected  to  a  central  PDP-Il/44  miniconcentrator. 

Our  experience  with  the  reliability  of  the  miniconcentratoi  hardware  has 
been  very  encouraging.  In  the  past  six  aK>nths  we  have  experienced  no  hard 
failures  in  the  eight  PDP-11/44  machines  with  which  we  have  been  working.  We 
have  experienced  an  intermittent  problem  with  one  of  the  machines  that  has 
been  in  use  in  our  laboratory  as  a  checkout  facility.  Over  the  aasw  period 
we  have  experienced  four  hard  failures  in  35  UMC-Z80  processor  and  memory 
boards  and  a  similar  number  of  intermittent  problems.  One  of  these  hard 
failures  caused  an  outage  oi  several  days  for  one  of  the  gateway  machines, 
but  otherwise  there  were  no  gateway  hardware  malfunctions  that  caused 
interruptions  in  network  experiments. 

B.  GATEWAY  SOFTWARE 

1.  Gateway  Program 

Several  major  extensions  to  the  PDP-11  gateway  program  were  achieved 
during  this  time  period.  These  extensions  are  in  the  areas  of  speech 
conferencing,  robustness,  measurcswnt  and  instrumentation  experiments,  and 
more  extensive  external  control  of  the  gateway  program. 

Foremost  mnong  the  extensions  was  multisite  speech  conferencing  between 
participants  on  local  networks  communicating  via  gateways  and  the  PSAT.  In 
order  to  establish  a  conference  the  gateways  use  ST  protocol  to  communicate 
with  each  other  the  control  messages  chat  were  originated  by  LEXNET  PVTs  and 


PRNET  SIUs.  Then  the  "bit  ■epe"  in  the  speech  messages  are  used  to  control 
the  forwarding  of  speech  packets  generated  by  the  participants.  This 
capability  culminated  in  a  demonstration  at  the  packet  speech  meeting  at 
Lincoln  on  3  June  in  which  the  participants  in  the  conference  were  at  a  PVT 
on  a  LEXNBT  at  Lincoln,  at  a  PVT  on  a  LEXHET  at  ISl,  and  at  an  SlU  on  a  PRNET 
at  SRI.  The  gateways  at  each  site  broadcast  the  LPC  speech  to  each  other 
using  stream  allocations  and  group  addressing. 

Several  prior  achieveownts  were  instrumental  in  the  success  of  the 
demonstration.  An  ad  hoc  protocol  was  developed  by  means  of  %ihich  each 
gateway  communicates  with  the  others  both  to  tell  them  that  it  is  on  the  air 
and  to  request  them  to  join  a  PSAT  group  that  it  has  created.  (PSAT  groups 
are  a  mechanism  provided  by  PSATs  in  which  several  hosts  who  belong  to  the 
group  receive  a  message  sent  to  the  group  address;  these  groups  are  then  used 
as  the  broadcast  addresses  for  the  transmission  of  the  speech  from  one 
participant  to  all  the  others.)  Additionally,  a  network  module  for  the 
gateway  to  handle  PRNET  communication  was  generated  and  several  gateways  were 
installed  on  the  SRI  POP-11/44  computer  to  handle  various  combinatioqs  of 
PSAT,  LEXNET,  and  PRNET  networks. 

To  provide  greater  robustness  in  establishing  connections  and 
conferences  a  tisw-out/retransmisaion  mechanism  has  been  developed  in  the 
gateway.  Using  a  state  table  the  gateway  retransmits  ST  control  messages  a 
number  of  times  until  they  are  correctly  acknowledged  or  until  a  fixed  number 
of  transmissions  have  been  performed.  If  success  is  not  achieved  then  the 
gateway  proceeds  to  recover  from  the  missing  control  messsges. 

In  order  to  assist  in  the  carrying  out  of  measurement  and 
instruswntation  experiments  the  handling  of  new-style  IP  source-routing  (a 
revised  standard  in  the  DARPA  Internet  community)  was  incorporated  in  the 
gateway,  replacing  an  older  obsolete  source-route  option.  The  new  source¬ 
routing  is  useful  for  experimentation  with  and  measurement  of  delays  involved 
in  using  different  network  paths,  and  in  particular  allows  us  to  source-route 
through  IP  gateways  (developed  by  BM  and  others)  located  elsewhere  in  the 
DARPA  Internet.  Additionally,  the  gateway  now  monitors  its  internal  meswry 
resources  in  histograms  during  its  operation.  This  enables  one  to  determine 
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the  extent  of  epare  capacity  for  varioua  experiaenta  with  the  goal  of 
balancing  reaourcea. 

A  protocol  was  developed  and  iagtleflanted  between  the  gateway  prograa 
running  in  the  PDP-’ll  and  the  network  input-output  prograas  running  in  the 
UMC  interface  processors.  This  protocol,  together  with  aatching  keyboard 
gateway  conaMnds,  peraits  one  to  exaaine  and  aodify  DMC  aeaory  while  the 
prograa  is  running.  This  provides  for  convenient  access  to  counts  that  the 
UMC  prograas  aaintain  during  their  operation.  Previously,  the  UMC  prograas 
had  to  be  halted  in  order  to  perfora  such  operations. 

In  certain  error  cases  the  gateway  now  generates  Internet  Control 
Message  Protocol  error  aessages  to  the  source  of  the  incorrect  sassage. 

The  Exploratory  Data  Metwork  (EDM)  was  added  to  the  list  of  networks 
that  the  gateway  supports. 

For  external  gateway  control  a  ntnber  of  new  coaaands  were  added. 
Included  are  coaaands  to  specify  the  PSATs  that  are  to  be  involved  in  group 
addresses,  to  create  and  delete  streasa,  to  clear  out  certain  infonsation 
that  the  gateway  is  aaintaining,  and  to  dynaaically  control  the  gateway  aode 
hits.  For  logging  and  snnitoring  purposes  the  date-tisK  is  supplied  in 
critical  places  in  terainal  output  produced  by  the  gateway.  Finally,  for 
aaintenance  purposes  each  gateway  has  a  version  niasber  and  unique  serial 
nuaber  associated  with  it. 

2.  Support  Software 

The  downline  comaunication  prograa  TOEPOS  was  installed  on  2  host 
coaputers  at  SRI  running  under  V7  UNIX  and  coaaunicating  with  the  object 
coaputer  via  a  MICOM  terainal  aailtiplexer.  This  aarked  the  fourth  distinct 
operating  environaant  in  which  the  prograa  has  been  installed. 

TOEPOS  was  extended  to  accoModate  downline  coaputers  which  do  not  run 
the  EPOS  operating  systea.  If  such  use  is  specified  when  TOEPOS  is  invoked, 
then  the  functions  relating  to  aultiplex  TTY  line  selection  are  auppressed, 
yielding  type-through  coaaunication  free  of  TTY  line  identifications.  Using 
this  awde,  TOEPOS  has  subsuaed  the  functions  of  the  outdated  TOLEXNET  prograa 


for  coanunicating  with  and  downloading  PVTa.  Additionally,  TOEPOS  is  now 
being  used  for  running  PVT  Heasureaent  Host  experiaents  and  controlling  the 
generation  of  subsequent  plots  of  the  results. 

A  coaaand  was  added  to  TOEPOS  by  aeans  of  which  a  user  can  request 
TOEPOS  to  supply  its  terainal  output  to  an  additional  user-specified  terainal 
line.  Such  output  is  in  addition  to  that  supplied  to  the  terainal  on  idiich 
TOEPOS  is  running  and  to  the  script  file.  This  provides  a  "watch"  capability 
in  which  a  remote  site  can  watch  the  actions  at  the  terainal  running  TOEPOS. 
This  capability  was  used  in  the  3  June  deaonstration  to  provide  instantaneous 
awareness  of  the  state  of  the  gateways  to  observers  at  Lincoln  and  the  site 
at  which  the  gateway  was  running. 


V.  WIDEBAND  NETWOBX  EXPERIMENTS  AND  EXPERIMENT  COORDINATION 


The  major  emphaaia  in  esperiawat  planning  and  coordination  in  the  April- 
June  time  frame  waa  preparation  for  a  wideband  packet  apeech  demonatration  to 
be  held  at  Lincoln  on  3  June.  The  general  goala  of  the  demonatration  bfd 
been  defined  at  the  wideband  aweting  at  DARPA  in  March,  accompanied  by 
extenaive  diacuaaion  of  the  taaka  and  integration  activitiea  that  would  have 
to  be  completed  throughout  the  network  in  order  to  achieve  theae  goala. 

DARPA  inaugurated  a  ayatem  of  weekly  reporting  of  the  atatua  of  all  of  theae 
activitiea,  with  Lincoln  acting  aa  the  coordination  point  for  all  the  aitea. 
The  weekly  reporta  contained  atatementa  of  the  goala  that  had  to  be  achieved 
with  reapect  to  WB  SATNET  operation  and  integration  of  varioua  equipment  at 
the  aitea,  and  included  aummariea  of  all  intermediate  mileatonea  achieved  to 
date  in  each  area. 

The  packet  apeech  demonatration  itaelf,  which  waa  conducted  during  the  3 
June  DARPA  packet  apeech  meeting  at  Lincoln,  waa  an  excellent  aucceaa.  A 
broad  aet  of  capabilitiea  for  packet  apeech  communication  over  WB  SATNET, 
with  internetwork  connectiona  to  LEXNETa  and  a  packet  radio  net  (PRNET),  waa 
ahown  by  amana  of  a  aequence  of  calla  among  the  Lincoln  (LL),  ISl,  and  SRI 
aitea.  The  demonatration  followed  a  period  of  intenae  effort  and  cooperation 
among  all  the  participanta  in  the  Wideband  Program,  which  culminated  in  a 
acrica  of  dry  runa  which  were  very  uaeful  in  final  integration  of  all  the 
aubayatema  involved. 

The  equipment  configurationa  that  were  available  at  the  four  network 
aitea  at  the  time  of  the  demonatration  are  aumiMrised  in  Fig.  7.  The 
aucceaaion  of  demonatration  calla,  to  be  deacribed  below,  will  be  deacribed 
in  the  context  of  Fig.  7. 

Aa  indicated,  each  of  the  four  aitea  had  ita  earth  atation,  SSI,  PSAT, 
and  varioua  hoata.  SRI  had  a  miaieoncentrator  gateway;  a  LSXNST  with  one 
Packet  ?oiee  Terminal  (PFT)  equipped  with  a  Lincoln  aingle-card  2.4-kbpa  LPC 
vocoder  in  addition  to  the  built-in  SA-kbpa  mu-law  P(Dt  coder;  and  a  Packet 
Radio  let  with  two  FRUIT  packet  voice  tetmiaala,  each  conaiating  of  an 
LSl-li/23  Speech  Interface  Onit  (SlU)  and  a  CHl-V  vocoder  running  the 
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2.4-kbpt  LPC-IO  algorithm.  One  of  these  voice  terminals  was  mounted  in  a 
van,  and  participated  in  the  mobile  PRNET  packet  voice  demonstration  below. 
ISl  had  a  Voice  Funnel  configured  as  a  gateway  (which  was  not  involved  in  the 
demonstration);  a  miniconcentrator;  a  LEXNET  with  two  PVTs;  and  a  PDP-11/45 
speech  host  with  packet  voice  and  video  capability  (which  were  also  not 
involved  in  the  demonstration).  One  PVT  was  equipped  with  an  ISl-built 
Switched  Telephone  Network  Interface  (STNl)  card  which  enables  a  64-kbps  PCM 
packet  voice  user  on  the  network  to  access  the  public  telephone  system,  as 
described  below,  and  the  other  was  equipped  with  the  single-card  LPC  vocoder. 
Lincoln  had  a  Funnel  (not  used  in  the  demonstration);  a  miniconcentrator  and 
LEXNET,  including  multiple  PVTs  and  the  Conferencing  Access  Controller 
described  below;  and  a  Packet/Circuit  Interface  (PCI)  and  Telephone  Office 
Emulator,  or  TOE  (developed  under  DCA  sponsorship,  and  not  involved  in  the 
demonstration).  Two  of  the  PVTs  were  equipped  with  16-  to  64-kbps  EBd>edded 
CVSD  (ECVSD)  vocoder  cards,  and  two  had  LPC  vocoder  cards.  The  DCEC  site, 
equipped  as  shown,  wav'  not  included  in  the  3  June  demonstration;  DCEC  was  the 
site  of  another  demonstration  in  early  October  focusing  on  the  PCIs.  and  TOBs. 

As  indicated  in  the  generic  packet  voice  call  demonstration  setup 
depicted  in  Fig.  8,  audio  pickups  were  attached  to  various  handsets  at 
Lincoln,  so  that  the  reconstituted  voice  signals  from  the  various 
participants  could  be  played  through  the  conference  room  audio  system  for  the 
benefit  of  the  audience.  Figure  8  also  illustrates  the  point  that  each  call 
required  two  classes  of  packets  to  be  transmitted  between  the  terminals: 

(a)  control  packets  to  set  up  and  take  down  the  call;  and  (b)  speech  packets 
to  transport  the  digitised  voice. 

The  demonstration  sequence  of  calls  which  were  completed  is  sunaMrised 
in  Table  IX.  The  demonstration  sequence  began  with  four  local  calls  on  the 
LEXNET  at  Lincoln,  as  follows:  a  point-to-point  call  using  64-kbps  PCM;  a 
comparison  of  voice  quality  at  the  various  rates  in  a  point-to-point  ECVSD 
call;  a  point-to-point  2.4-kbps  LPC  call;  and  a  4-party  conference  using  64- 
kbps  PCM.  Two  point-to-point  calls  over  the  satellite  net  were  then 

•  64-kbps  call  between  a  PVT  at  Lincoln  and  one  at  ISI,  and  a  64- 
kbps  call  from  Lincoln  to  the  local  Los  Angeles  weather  forecast  nuaiber  by 
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TABLE  IX 

SUMMABX  or  PACEET  »EBCR  DEHDHSTRATIOE  SEQUENCi:; 

3  JUNE  1982 

LOCAL  CALLS  ON  LINCOLN  LEXNET 
Point-to-Point 

1.  PCM  (64  kbps) 

2.  ECV8D  (16  to  64  kbps) 

3.  LPC  (2.4  kbps)  Using  Single-Csrd  LPC  in  PVT 
Confersncs 

4.  PCM  4-Psrty  Confersncs 
CALLS  OVER  WIDEBAND  SATNET 

Point-to-Point 

5.  PCN:LL  -  181 

6.  PCMtLL  -  Switched  Telephone  Network  Interfsce  at.  181 

(call  to  local  Los  Angeles  weather) 

Conference 

7.  3-Site,  4-Party  LPC  Conference 

LL  (2  parties  on  LEXNET) 

I8I  (on  LEXNET) 

SRI  (on  PRNET  unit  in  speech  lab,  using  CHI-V  LPC 
and  81U) 

Point-to-Point  (wobile) 

8.  LPC:SR1  (wobile  PMET  unit)  -  LL  (LEXNET  PVT) 

Nobile  Van  Run  Dewonstrating  PRNET  Speech 
and  Alternate  Routing 


way  of  the  STNI  card  at  ISI.  A  3-aite,  4-party  LPC  conference  over  the  WB 
SATNBT  waa  then  aet  up,  involving  two  PVTa  at  Lincoln;  a  PVT  at  ISI;  and  a 
PniET  packet  voice  terminal  in  the  apeech  laboratory  at  ISI.  Finally,  an 
extended  mobile  call  waa  aet  up  between  a  PVT  at  Lincoln  and  the  moving 
packet  radio-equipped  van  on  local  atreeta  and  the  Bayahore  Freeway  near  SRI, 
uaing  2.4-kbpa  LPC  via  a  PRNBT  packet  voice  terminal  in  the  van.  During  the 
latter  run,  aignala  from  the  van  were  aent  to  the  PRNBT  baae  atation  at  SRI 
via  aingle  and  awltiple-hop  routea,  aa  determined  by  obatructiona  in  the 
line-of-aight  patha,  and  were  relayed  from  the  baae  atation  to  Lincoln  via 
the  WB  SATNET. 

An  important  conaideration  throughout  the  preparationa  for  the  3  June 
demonatration  waa  Weatern  Union  activity  in  upgrading  the  remote  earth 
atation  monitoring  and  control  equipaMnt  and  reaolving  a  few  known  probleiM 
with  the  earth  atation  equipment  at  the  varioua  aitea.  At  the  time  of 
initial  inatallation  a  temporary  expedient  had  been  taken  for  remote 
monitoring  at  each  of  the  four  aitea,  becauae  the  higher-quality  equipment 
normally  uaed  by  Weatern  Onion  waa  not  available  on  the  neceaaary  t.Um  acale. 
It  waa  agreed  with  Weatern  Union  in  March  that  a  team  would  viait  each  aite 
in  turn  to  complete  the  inatallation,  with  acheduling  and  coordination 
carefully  handled  to  avoid  conflicta  with  teating  of  apeech  equipamnt  for  the 
3  June  demonatration.  Thia  effort  waa  largely  aucceaaful;  the  new  equipment 
waa  put  in  place  at  all  aitea,  and  it  waa  fully  activated  and  checked  out  at 
all  aitea  except  ISI.  Varioua  problema  and  delaya  had  impeded  completion  of 
that  final  atep  until  a  abort  time  before  the  demonatration,  and  Weatern 

Union  waa  aaked  to  defer  further  work  to  give  maximum  time  for  teating  and 
preparation  for  the  demonatration. 

By  agreement  with  Weatern  Union,  it  haa  been  determined  that  Lincoln 
Laboratory  will  act  aa  the  tranamitted  frequency  and  power  reference  atation 
for  the  Wideband  SATNBT.  The  requirement  for  auch  a  reference  ia  baaed  on 
the  fact  that  theae  two  parametera  muat  be  identical  at  all  atationa  in  the 
network,  within  rather  narrow  limita,  to  accommodate  the  burat-to-burat  AGC 
and  AFC  windowa  of  the  BSIa  and  the  FCC  power  allocationa  aaaigned  to  Weatern 
Union.  Initial  alignment  of  theae  parametera  ia  difficult,  involving 


transportation  of  calibration  equipaent  to  each  site,  while  tracking  and 
correcting  long-tera  drifts  pose  additional  probleaa. 

The  chosen  solution  for  these  probleaa  is  to  install  an  accurate 
spectruai  analyser  with  an  intarnal  frequency  reference  at  one  site,  and  to 
periodically  verify  that  each  of  the  other  stations  transaits  at  the  saae 
frequency  and  power,  aa  aeasured  at  the  reference  site.  This  iaplies  that 
signals  arriving  at  the  satellite  will  be  identical,  and  that  the  receiver  at 
any  given  site  (although  it  aay  have  an  uncoapensated  and  uniaportant 
internal  offset)  will  receive  all  the  signals  without  burst-*to-burst 
variation.  Perforaing  these  periodic  checks  will  require  cooperation  between 
BBH  and  Lincoln,  in  that  the  station  under  test  aust  be  the  only  one 
operating  specific  repetitive  data  patterns  that  can  be  conveniently  tracked 
by  the  spectrua  analyser. 

A  aajor  objective  of  current  wideband  experiaental  activity  is  to 
achieve  increased  systea  throughput  in  order  to  support  aultiplesing  of  large 
nuabers  of  users.  The  network  has  typically  been  operated  at  a  burst  rate  of 
772  kbps  up  to  the  present,  thus  keeping  the  bit  error  rate  sufficiently  low 
to  reaove  it  froa  consideration  in  isolating  and  correcting  bugs  in  the 
various  gateways  and  speech  equipaent.  Hore  recently,  with  steadily 
increasing  confidence  in  this  equipawnt,  various  coabinations  of  higher  bit 
rates  with  and  without  coding  are  being  tested  for  extended  periods. 
Concurrently,  experiaents  are  being  planned  and  perforaed  to  coapare  the 
aultiplexing  perforaance  of  the  systea  with  the  predictions  eabodied  in 
H-Note>33,  'Videband  Network  Throughput  Calculations,"  distributed  and 
discussed  at  the  March  Wideband  Meeting.  Figure  9  identifies  the  various 
points  in  the  network  that  can  potentially  liait  systea  throughput,  as 
analysed  in  the  W-Mote,  and  it  is  of  considerable  in^rest  to  verify  these 
issues  as  a  step  in  proposing  systea  iaprovesants.  For  exaaple,  a  recent 
experiaant  was  carried  out  in  which  streaa  capacity  was  requested  for  four 
siaultancous  bA-kbps  full-duplex  conversations  over  the  channel  at  a  burst 
rate  of  772  kbps,  and  the  P8AT  refused  the  request.  Sufficient  streaa 
capacity  for  three,  but  not  four,  conversations  was  successfully  obtained. 
W-Note-33  had  predicted  that  five  such  converations  could  be  supported  under 
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these  circusistances.  Detailed  discuasioa  with  the  BBN  PSAT  teas  on  the 
allocation  of  apace  in  the  P(n)A  fraae  has  allowed  us  to  account  for  the 
difficulty.  Firat,  the  current  fixed-PODA  (FPODA)  iapleaentation  allows  for 
growth  to  10  stations  (rather  than  accowaodating  only  the  current  4)  and 
allots  fixed  reservation  apace  in  the  FPODA  frawe  for  each  of  the  10.  This 
consuaes  about  3,000  of  the  16,384  bita  in  the  PODA  fraae  (at  772  kbps). 

2,000  bits  in  each  fraae  are  reaerved  for  datagraa  (not  streaa  tranaaission) . 
When  all  PODA  and  burat  overhead  is  accounted  for,  the  allowed  atreaa 
capacity  (including  ST  headera)  ia  about  10,500  bits  per  fraae,  or  495  kbps. 
Four  full-duplex  PCN  calls  require  512  kbps,  exclusive  of  ST  headers.  Hence,- 
only  three  can  be  accoaaodated  at  the  present  tiae.  At  a  future  tiaa  to  be 
determined,  BBH  has  agreed  to  cut  back  on  the  reserved  FPODA  reservation  and 
datagraa  space  so  that  four  calls  can  be  accowaodated.  This  will  be  a  useful 
experiment  in  testing  throughput  limits  in  other  parts  of  the  systea. 


Packet  speech  experiaeat  systea  configuration  as  of  June  1982 


VI.  VOICE  CONTROL  OF  NETWORK  VOICE  CONFERENCING 


A.  OVERVIEW 

LPC  and  PCM  conferences  involving  two  to  four  participants  have  been 
successfully  set  up  by  voice  between  two  LEXNETs  connected  through  a  gateway. 
A  conference  initiator  first  dials  the  Voice-Controlled  Operator  (VCOP)  and 
names  conference  participants  and  the  vocoder  type  during  a  voice  dialogue 
with  the  VCOP.  The  VCOP  then  establishes  the  conference  by  ringing  PVT 
telephones  of  conference  participants.  Upon  answering,  each  participant  is 
immediately  part  of  the  conference  and  can  hear  and  talk  to  the  current  set 
of  participants.  A  conference  initiator  can  also  call  up  the  VCOP  and  train 
it  to  recognize  his/her  speech. 

Successful  setup  of  conferences  followed  extensive  tests  of  the  VCOP 
recognition  accuracy  and  development  of  a  human-VCOP  dialogue  controller. 

The  recognition  accuracy  of  the  T580  word  recognition  system  used  in  the  VCOP 
was  evaluated  using  PVTs  on  a  local  LEXNET  and  PCM,  CVSD,  and  LPC  speech 
coding.  Recognition  accuracy  ranged  from  85  to  98  percent  correct.  This 
accuracy  can  support  human-VCOP  voice  interaction  when  talkers  confirm 
recognition  responses.  A  human-VCOP  interaction  protocol  which  performs  well 
at  these  recognition  rates  was  developed  and  tested  with  six  male  talkers. 

All  successfully  set  up  10  conferences,  five  with  6A-kbps  PCM  coding,  and 
five  with  2.A-kbps  LPC  coding.  Conference  setup  times  typically  ranged  from 
40  to  70  s  and  times  to  train  the  VCOP  to  recognize  each  talker's  words 
ranged  from  4  to  6  min. 

B.  RECOGNITION  ACCURACY  OVER  A  LEXNET 

The  recognition  accuracy  of  the  T580  word  recognition  system  in  the  VCOP 
was  evaluated  to  determine  whether  it  can  support  a  conference  setup  dialogue 
with  all  types  of  speech  coding  and  also  to  uncover  any  LEXNET  or  PVT  system 
characteristics  which  need  modification  to  isqprove  recognition  accuracy. 
Performance  was  evaluated  using  a  20-word  vocabulary  (the  digits  zero  to 
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nine,  stop,  start,  yes,  no  go,  help,  erase,  rubout,  repeat,  enter)  four  sale 
talkers,  and  two  fesuile  talkers.  Ten  repetitions  of  each  word  were  used  for 
training.  Each  talker  read  through  the  list  of  20  words  ten  tisws  (200 
utterances)  under  each  test  condition.  Tests  were  performed  on  a  dedicated 
LEXNET  with  two  PVTs.  One  was  a  normal  PVT  used  by  the  talker  and  the  other 
was  a  modified  PVT  which  was  part  of  the  VCOP.  Tests  were  performed  over  the 
LEXNET  using  (1)  64-kbps  PCM  coding  without  silence  detection,  (2)  16-kbps 
CVSD  coding  without  silence  detection,  and  (3)  2.4-kbps  LPC  coding  with 
silence  detection.  In  addition,  recognition  accuracy  was  measured  for  two 
talkers  using  a  direct  microphone  input  to  the  TS80  recognition  system  and 
also  using  the  three  types  of  coding  both  with  and  without  silence  detection. 
A  side  tone  was  present  in  the  talker's  handset  during  all  testa  to  help 
reduce  variations  in  the  input  speech  level  to  the  speech  coding  cards  and 
the  recognition  system. 

Recognition  accuracy  was  highest  (99  percent  correct)  with  direct 
microphone  input.  Accuracy  dropped  slightly  with  PCM  coding  (average  *  94. S 
percent,  range  ■  88  to  99  percent),  with  CVSD  coding  (average  •  92.5  percent, 
range  *  86  to  98  percent)  and  with  LPC  coding  (average  *  91  percent,  range  ■ 
86  to  95  percent).  The  addition  of  silence  detection  in  the  PVT  reduced 
recognition  accuracy  by  roughly  3.5  percentage  points  from  an  average  of  97.5 
percent  to  an  average  of  94  percent  for  all  coding  schemes  for  the  two 
talkers  on  whom  extensive  swasurements  were  made.  Degraded  perfonsance  with 
speech  coding  was  expected  because  of  the  reduced  analog  bandwidth  of  the  PVT 
telephone  coaq>ared  to  the  bandwidth  of  the  close-talking  microphone  supplied 
with  the  recognition  system  (3500  vs  8000  Rs)  and  because  of  the  information 
lost  in  the  coded  speech.  The  addition  of  silence  detection  degraded 
performance  presumably  because  it  prevented  the  complex  word  endpoint 
detection  algorithm  in  the  recognition  system  from  functioning  correctly. 

When  silence  detection  was  active  this  algorithm  was  effectively  replaced  by 
the  simple  silence  detection  algorithms  used  in  the  PVT. 
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C.  PERFORHANCE  OF  HUMAH/VCOP  IHTERACTION  OURING  CONFERENCE  SETUP 

An  effective  VCOP  human  interaction  muat  improve  on  the  recognition 
accuracy  of  the  TS80  recognition  system  (86  to  99  percent)  by  (1)  restricting 
the  set  of  allowable  recognition  responses  at  each  step  of  the  dialogue  and 
(2)  forcing  the  caller  to  verify  all  recognition  responses.  Verification 
involves  aural  echoing  of  the  T580  recognition  responses  using  the  VOTRAX 
synthesizer  followed  by  a  "YES"  or  "NO"  judgment  by  the  caller.  A  conference 
setup  dialogue  with  no  recognition  errors  that  uses  these  techniques  is 
illustrated  in  Fig.  10.  Note  that  any  response  except  "YES"  and  "NO"  is 
echoed  for  verification.  Although  not  shown,  an  incorrect  recognition  result 
with  a  "NO"  verification  causes  the  previous  VCOP  prompt  to  be  repeated.  In 
addition,  a  "HELP":  response  provides  additional  information  about  allowable 
responses  and  a  "UH?",  "WHAT?"  or  any  other  unrecognizable  response  causes 
the  previous  prompt  to  be  repeated. 

The  above  conference  setup  procedure  was  evaluated  using  six  male 
talkers  on  a  dedicated  LEXNET  with  two  PVTs  (one  for  the  talker,  one  part  of 
the  VCOP) .  The  conference  setup  procedure  was  stopped  as  soon  as  the 
initiator- VCOP  dialogue  was  complete.  Each  talker  set  up  ten  two-to-four 
member  conferences.  Half  of  these  were  set  up  using  64-kbps  PCM  coding 
without  silence  detection  and  half  were  set  up  using  2.4-kbps  LPC  coding  with 
silence  detection. 

The  vocabulary  used  during  tests  included  twenty  words:  four  commands 
(YES,  NO,  HELP,  DONE),  three  vocoder  types  (PCM,  LPC,  CVSD),  ten  participant 
names  (Lippman,  Hsiung,  Weinstein,  Feldman,  Gold,  Blankenship,  Forgie, 
O'Leary,  Casner,  Craighill)  and  three  group  names  irtiich  describe  grouping  of 
three  or  more  participants  (24,  network,  speech).  Talkers  initially  trained 
the  recognition  system  by  producing  each  word  five  times. 

Training  tisK  for  the  20-word  vocabulary  was  typically  four  to  six 
minutes.  All  talkers  learned  how  to  set  up  a  conference  with  minimal 
instruction  and  conference  setup  items  typically  ranged  from  40  to  70  s 
depending  on  the  speech  coding  used  and  the  number  of  participants.  All 
conferences  were  set  up  successfully  and  the  VCOP  made  no  fatal  mistakes. 
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The  recognition  accuracy  averaged  over  all  utterances  produced  by  talkers 
during  the  setup  procedure  including  YES/NO  confirmations  was  97.5  percent 
for  PCM  coding  and  96.5  percent  for  LPC  coding.  These  relatively  high  rates 
were  caused  by  the  restricted  vocabulary  sise  used  at  each  stage  of  the  human 
interaction. 

D.  VCOP/NVP  INTERACTION 

PVT  software  was  modified  to  add  new  MVP  tokens  needed  to  set  up 
conferences  with  the  VCOP  and  to  support  communications  between  the  PVT  and 
PDP-11/44  which  are  parts  of  the  VCOP.  The  software  in  the  PVT  that  is  part 
of  the  VCOP  was  modified  to  include  a  low-level  protocol  used  to  communicate 
with  an  11/44  which  is  also  part  of  the  VCOP.  This  11/44  controls  the  T580 
recognition  system  and  the  VOTRAX  synthesiser  using  RS-232  connections  and 
communicates  to  a  PVT  over  a  RS-232  line.  It  answers  the  PVT  phone  when 
someone  dials  the  VCOP,  obtains  conference  setup  information  using  the 
recognition  system  and  the  synthesiser,  and  then  passes  this  information  to 
the  VCOP's  PVT.  The  VCOP's  PVT  sends  new  Please-Join-A-Conference  tokens  to 
PVTs  of  all  conference  participants  after  receiving  conference  setup 
information.  Participant  PVTs  ring  and  when  answered,  establish  point-to- 
point  connections  with  other  conference  participants  after  exchanging  NVP 
tokens  with  the  conference  access  controller.  Participants  then 
autosMtically  join  the  conference  as  soon  as  they  pick  up  their  phones. 

E.  NETWORK  TESTS 

Voice  conferences  have  been  sec  up  between  participants  on  two  LEXRBTs 
at  Lincoln  connected  via  gateways  and  a  PSAT.  The  VCOP  and  conference  access 
controller  were  connected  to  one  LSXNBT  along  with  one  or  two  PVTs.  One  to 
three  PVTs  were  on  the  other  LBXNBT.  PCM  and  LPC  conferences  were 
established  by  voice  by  an  initiator  on  either  the  VCOP's  LBXNBT  or  on  the 
remote  LBXNBT.  Conferences  included  from  two  to  four  participants.  The 
human-VCQP  interaction  was  always  successful.  At  the  end  of  this  interaction 
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PVT  telephones  of  all  participants  rang  and  upon  answering,  each  participant 
immediately  joined  the  conference.  Some  problems  with  the  NVP  protocol  used 
to  establish  and  terminate  the  conference  were  discovered.  The  NVP 
interaction  is  now  being  sMde  more  robust. 
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INITIATOR  DIALS  VCOP  NUMBER  FOLLOWED  BY  USER  ID  NUMBER 

VCOP  ‘HELLO  IS  THIS  MR.  WEINSTEIN? " 

CALLER  “YES" 

VCOP  “WOULD  YOU  UKE  TO  SET  UP  A  CONFERENCE?  ' 

CALLER  “YES" 

VCOP  “VOCODER  TYPE  PLEASE" 

CALLER  ““LPC““ 

VCOP  “  IS  THAT  LPC?““ 

CALLER  “YES  ' 

VCOP  “PLEASE  SAY  PARTICIPANT  OR  GROUP  NAME  OR  DONE " 

CAUER  "SPEECH" 

VCOP  "IS  THAT  SPEECH?  " 

CALLER  “  YES" 

VCOP  "NEXT" 

CALLER  “DONE  " 

VCOP  "IS  THAT  DONE?  " 

CALLER  “YES" 

VCOP  "THANK  YOU,  PLEASE  WAIT" 

PVT  TELEPHONE  OF  EACH  MEMBER  IN  THE  GROUP  NAMED  “  SPEECH  "  IS  RUNG 
MEMBER  AUTOMATICALLY  ENTERS  CONFERENCE  WHEN  PHONE  IS  PICKED  UP 


Fig.  10.  Sample  dialogue  between  a  conference  call  initiator 
and  the  VCOP.  Note  that  the  reply  "speech"  to  the  query  "please 
say  participant  or  group  name  or  done"  refers  to  a  predefined 
group  of  participants  interested  in  speech  processing.  Individual 
names  could  also  have  been  entered. 
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